arXiv:l507.00717vl [q-bio.PE] 2Jul2015 


Stability of cooperation under image scoring in group interactions 


Heinrich H. Nax,' El Matjaz Perc,^’^ Attila Szolnoki/ and Dirk Helbing^’® 

^Department of Social Sciences, ETH Zurich, 8092 Zurich, Switzerland 
^Department of Physics, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia 
^Faculty of Natural Sciences and Mathematics, University of Maribor, Koroska cesta 160, SI-2000 Maribor, Slovenia 
"^Institute of Technical Physics and Materials Science, Centre for Energy Research, 

Hungarian Academy of Sciences, PO. Box 49, H-1525 Budapest, Hungary 
^Computational Social Science, ETH Zurich, 8092 Zurich, Switzerland 
^Risk Center, ETH Zurich, 8092 Zurich, Switzerland 

Image scoring sustains cooperation in the repeated two-player prisoner’s dilemma through indirect reci¬ 
procity, even though defection is the uniquely dominant selfish behaviour in the one-shot game. Many real- 
world dilemma situations, however, firstly, take place in groups and, secondly, lack the necessary transparency 
to inform subjects reliably of others’ individual past actions. Instead, there is revelation of information regarding 
groups, which allows for ‘group scoring’ but not for image scoring. Here, we study how sensitive the positive 
results related to image scoring are to information based on group scoring. We combine analytic results and 
computer simulations to specify the conditions for the emergence of cooperation. We show that under pure 
group scoring, that is, under the complete absence of image-scoring information, cooperation is unsustainable. 
Away from this extreme case, however, the necessary degree of image scoring relative to group scoring de¬ 
pends on the population size and is generally very small. We thus conclude that the positive results based on 
image scoring apply to a much broader range of informational settings that are relevant in the real world than 
previously assumed. 


Public goods provision iHll , common-pool resource man¬ 
agement [j], and other social dilemma situations often require 
large groups of individuals to make individually costly contri¬ 
butions toward a collective action, i.e., to cooperate 0. With¬ 
out sufficient consideration of future consequences, however, 
many real-world interactions lack the necessary cooperative¬ 
ness to prevent misalignment of private and social interests. 
This leads to socially undesirable outcomes. The result may 
be irrevocable mismanagement and over-exhaustion of shared 
resources, ultimately resulting in the ‘tragedy of the com¬ 
mons’ Sllli, which is a long-term outcome that is worse 
for everyone. 

Hence, the ‘puzzle of cooperation’ IMl. Why do certain 
social interactions flourish with high levels of foresight and 
cooperation, while others are im ped ed by short-sighted free¬ 
riding behaviour? Game theory 1112, H^ provides the neces¬ 
sary theoretical framework for studying the individual-level 
motivations ifldl] to answer this question. Indeed, game the¬ 
ory provides many answers, including a variety of evolution¬ 
ary ifl^H^ and psychological |fl7l - [l^ explanations, but only 
two based, in the language of game theory, on ‘rationality’. 

One of these two rational explanations is based on fore¬ 
sightedness. Namely, if individuals take future consequences 
sufficiently into account, then equilibria exist supported by 
more sophisticated, repeated-game strategies that overcome 
the short-term incentives to free-ride 11201] . Unfortunately, 
to uphold cooperation that way, individuals would need to 
commit to strategies in ways that may produce grim conse- 
quences if not everyone follows the strategy equilibrium path 
[|^ . Hence, groups consisting of sufficiently foresighted 
agents may succeed to guarantee provision of a common. 
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while groups with too many short-sighted or boundedly ra¬ 
tional agents will fail. The second rational explanation of 
the puzzle of cooperation is based on building a reputation 
from the past ll2^ . Essentially, players build a commitment- 
to-cooperation reputation through their past actions, and find 
each other, thus, outperforming defectors (who are stuck with 
other defectors), even though defection is the uniquely domi¬ 
nant selfish behaviour in the one-shot game ifldl] . As a result, 
if reputation matters sufficiently in determining with whom 
individuals interact, then cooperation can survive despite lim¬ 
ited foresight. 

Unfortunately, due to the inherent simultaneity of interac¬ 
tions Q, it is impossible to condition one’s own decision on 
the decisions of the others Q. One of the most important, 
and perhaps the simplest, reputation mechanism known in the 
literature to overcome this problem is image scoring ll^ . Fa¬ 
mously, image scoring can sustain cooperation in the repeated 
two-player prisoner’s dilemma through various forms of indi¬ 
rect reciprocity ll25l - lz^ . Under image scoring, agents learn 
who cooperated and who defected in previous interactions, 
and consequently condition their own actions on this informa¬ 
tion. Essentially, image scoring enables cooperators to find 
each other, and this overcomes the negative Nash equilibrium 
prediction of universal defection from the one-shot game. In¬ 
teresting^ image scoring has been shown to work in the lab¬ 
oratory 11281 - 1^ . but in general, it is considered to provide a 
relatively frail support to cooperative behaviour 11^l3^ . 

In fact, many real-world social dilemmas unfold in groups 
llHI, and it is unlikely that individuals will have access to oth¬ 
ers’ individual action histories Hi. Information that should 
be readily available, however, concerns the performance of 
the groups as a whole. Such information thus enables ‘group 
scoring’ as an alternative to image scoring. In particular, the 
image of an individual is no longer determined by its own past 
action, but by the performance of the group where an individ- 
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ual is member. More precisely, each player’s group score sum¬ 
marizes the aggregate cooperativeness of the groups where he 
was a member in the past, without any additional information 
regarding what the player did individually. Two important 
and previously unaddressed new questions emerge; (i) How 
do results related to image scoring generalize to group scor¬ 
ing?, and (ii) How sensitive are these results to information, 
that is, when image scoring constitutes a proportion p G [0,1] 
of information made available and the residual information is 
based on group scoring? 

The common feature of previous research on image scor¬ 
ing is that, over time, cooperators achieve higher scores and 
defectors achieve lower scores, and interactions are matched 
based on these scores such that thus cooperators play with co- 
operators and defectors play with defectors. In our paper, we 
build on this common feature by assuming the existence of a 
mechanism that assorts and matches players by their scores. 
In addition, we extent the scope of image-scoring-based mod¬ 
els by analysing the sensitivity of the results to the imperfec¬ 
tions of scores that ought to reflect individuals’ true past coop¬ 
erativeness rather than the overall performance of the groups 
where they are members. Our analysis continuously spans the 
worlds of two extremes; image scoring and group scoring. As 
we will show, image scoring, reflecting accurately individu¬ 
als’ past actions, works perfectly also in the generalized pris¬ 
oner’s dilemma game that is governed by group interaction. 
Conversely, group scoring fails, as it enables defectors to ef¬ 
fectively hide behind the cooperative efforts of others in the 
group. But how many true images are needed for cooperation 
to evolve in group interaction? In other words, what is the 
necessary proportion p G (0,1) of image scoring? It turns out 
that this depends sensitively on the underlying parameters of 
the interactions in ways that provide a formal basis for some 
of Ostrom’s conditions for successful common-pool resource 
management a. Key determinants are the rate of return, the 
size of the population, and the group size. Remarkably, for 
large populations only a ‘grain’ of image scoring is generally 
sufficient for cooperation to become dominant. 


Results 

We shall now formalize our arguments, generalizing step by 
step the two-by-two prisoner’s dilemma model due to 11^ to 
the more general context of the voluntary contributions game 
M. Suppose a population N = {l,2,...,n} plays the 
following game in rounds t = {l,2,...,oo}. Each player 
i G N, in period t, chooses a contribution c* from budget 
B = {0,1}. At the same time, each player i is associated 
with a binary score si = {0,1}, and players are matched 
into k groups S — {S'!,..., Sk} of a fixed size s = n/k ac¬ 
cording to the ranking of players’ scores (with random tie¬ 
breaking). After groups have formed, I’s resulting payoff 
turns out 4>l{c) = (1 - c-) -f r/s * where Sf is 

j’s group, r is the game’s fixed rate of return, and r/s the 
game’s ‘marginal per-capita rate of return’ that summarizes 
the underlying game’s synergy. We assume, as is standard, 
r G [l,s], that is, contributing a unit of budget is socially 


beneficial (yielding a sum total of payoffs to all players larger 
than one), but individually costly. 

We consider the following range of scoring mechanisms be¬ 
tween image scoring and group scoring. 

Image scoring: First, we formulate the equivalent of image 
scoring in our setup: at time t each player i has an 
image score, Sj^, known to every player which is based 
on decisions prior to t. In period f -f 1, if Es period-f 
contribution c* exceeded the average contribution c* = 
at time t, then his image score is one; if 
c* < cf then it is zero. Finally, if c‘ = c*, then s/^^ = 

Group scoring: Analogously, we formulate group scoring: 
at time t each player i has a group score, Sq^. Sup¬ 
pose Fs period-f group is S. If the contribution in S, 
exceeded the average group contribu¬ 
tion overall, cg = period-(f + 1) 

group score is one; if Cg < Cg then it is zero. Finally, if 
c% = 4, then s*+i = s^ 

Hybrid scoring: A hybrid between image scoring and group 
scoring in our setup means that, at time t, each player i 
has a hybrid score, known to every player which is 
based on decisions prior to t. In period f -f 1, if i’s score 
is updated according to image scoring with probability 
p, and according to group scoring with probability 1 —p. 

To summarize the three scoring methods, the types of in¬ 
formation necessary under the different regimes are as fol¬ 
lows. For image scoring, ex post individual-level information 
about contribution decisions is necessary, group-level infor¬ 
mation is therefore trivially also available. For group scor¬ 
ing, group associations and ex post group-level contributions 
need be known, individual-level information is not necessary. 
For the hybrid case, characterized by degree of image scoring 
p G [0,1], the probability that individual-level information 
rather than only group-level information becomes available 
must be larger than zero. 

The stability of cooperation under the different scoring 
rules can be evaluated. One result is that tragedy of the com¬ 
mons (resulting from universal defection) is a potential risk in 
all cases. The stability of universal defection under all scoring 
rules derives from the fact that unilateral defection is a best 
response under all scoring methods against a state of univer¬ 
sal defection. However, the relative stability of this worst-case 
outcome vis-a-vis a highly cooperative state critically depends 
on the scoring rule, mitigating this issue. 

Under image scoring, high levels of cooperation can be sta¬ 
bilized and then turn out to be more stable. This is the case 
if the proportion of cooperators with score one (matched in 
good groups) grows exactly at the speed so as to neutralize 
the shrinking of the proportion of cooperators with score zero 
(matched in bad groups). The defectors profit from the con¬ 
tributions of the latter group and achieve an average growth 
equal to that of the average cooperator. 

Stability is summarized with the results presented in Fig. [T] 
from simulations. The Methods section contains analytical 
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FIG. 1: Under image scoring cooperators always rise to complete 
dominance. Shown is the fraction of cooperators in dependence on 
the number of iterations. The inset shows the corresponding fraction 
of defectors on a semi-logarithmic scale to highlight their exponen¬ 
tial decay. Depicted results are averaged over 200 independent re¬ 
alizations to minimize fluctuations. Parameter values are: s = 5, 
n = 1500 and r = 4. 


proof of these results, as well as a description of the employed 
Monte Carlo simulation procedure. It can be observed that the 
state of complete cooperation is reached exponentially fast. 
We emphasize that this result is recovered independently of 
the value of s, r and n, and it is also robust against variations 
of the strategy adoption rule. Cooperation will always prevail 
under image scoring, as it allows cooperators to separate from 
defectors. In general, cooperators form homogeneous groups 
that provide them with a competitive payoff. Conversely, de¬ 
fectors must be content to form groups with their like, which 
provides them a null payoff. Cooperators can therefore eas¬ 
ily invade defectors, and they do so with a speed that is pro¬ 
portional to their number, which ultimately gives rise to the 
exponentially fast downfall of defectors. 

At first sight, such a state of cooperation may also seem 
a candidate for stability under group scoring. Inspection of 
the individual growth dynamics, however, reveals one cru¬ 
cial difference. Namely, cooperation states are not robust 
against the influx of defectors with score one. These play¬ 
ers outperform all others, which, jointly with the fact that 
score-zero defectors outperform score-zero cooperators, im¬ 
plies an above-average growth rate for defection vis-a-vis co¬ 
operation. In other words, the key difference between image 
scoring and group scoring is that defectors can only free-ride 
on the contributions of others under image scoring, while, un¬ 
der group scoring, defectors can free-ride on the contributions 
and scores of others. 

Figure|2]mirrors the set-up of Fig.[T]in terms of the simula¬ 
tion procedure, only that here group scoring instead of image 
scoring is used. It can be observed that, irrespective of the 
initial fraction of cooperators, they eventually die out. As by 
image scoring, this outcome too is robust against variations 
of s, r, n and the strategy adoption rule. Group scoring al¬ 
lows defectors to have the same high score as cooperators. 



FIG. 2: Under group scoring cooperators always die out. Shown is 
the fraction of cooperators in dependence on the number of iterations 
for two different initial fractions (see legend). Depicted results are 
averaged over 200 independent realizations to minimize fluctuations. 
Parameter values are the same as in Fig.[T] 


which in turn disables the separation of the two strategies into 
homogeneous groups. In agreement with the outcome of the 
public goods game in a well-mixed population, even a single 
defector can therefore eventually invade the entire population. 
Groups scoring thus completely fails to mitigate the tragedy 
of the commons. 

Since image scoring and group scoring could not be more 
different in their ability to stabilize cooperation, it remains of 
interest to determine the merit of hybrid scoring. While it 
seems reasonable to assume that sometimes the information 
about the past of each particular individual is readily avail¬ 
able, more often that not the scoring of an individual is pos¬ 
sible only indirectly through the achievements of the groups 
where s/he was member. We note that individual contributions 
in group efforts are notoriously difficult to pinpoint, which is 
also why the reciprocation to such efforts is quite a vague con¬ 
cept - if a group contains a cooperator and a defector, who 
do you reciprocate with lf3^ ? The question thus is, just how 
much individual-level information is needed to stabilize coop¬ 
eration? To answer this question, we introduce the probability 
p that a player’s score is determined by image scoring, while 
otherwise, with probability 1 — p, group scoring is used. All 
other simulation details remain the same as in Figs. [1] and |2] 

Results presented in Fig. [3 show that cooperation can 
evolve even at a very small p value, if only the population size 
is sufficiently large. The key for the stability of cooperation 
is for cooperators being able to recognize each other through 
their high scores, and thus to form homogeneous groups. The 
lower the value of p and the lower the value of r, the longer 
it takes for cooperators to segregate from defectors. Since co- 
operators are threatened by extinction, it is imperative that the 
segregation occurs before defectors take over. Accordingly, 
the lower the value of p and r, the larger the population size 
needs to be to warrant sufficient time to cooperators to segre¬ 
gate before they die out. A lower bound for pisp > 2In. Re¬ 
sults presented in Fig. |4] make these arguments quantitatively 
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FIG. 3: Under hybrid scoring, even a minute probability for access¬ 
ing the image score of an individual player suffices to stabilize coop¬ 
eration. However, the size of the population is crucial, as the segre¬ 
gation of cooperators from defectors takes the longer the smaller the 
value of p. Thus, if the population size is too small, cooperators are 
likely to die out before segregating into homogeneous groups. Seg¬ 
regation is also decelerated by low values r (see Fig.|4]for details). 
Shown is the fraction of cooperators in dependence on the number 
of iterations, as obtained for different numbers of groups forming the 
population (indicated alongside the lines). Depicted results are aver¬ 
aged over 50—200 independent realizations to minimize fluctuations. 
Parameter values are; p — 0.01, s = 5 and r — 1.1. 


more accurate. Evidently, the lower the population size, the 
higher the values of p and r need to be for cooperation to pre¬ 
vail. In small populations, there exist critical threshold values 
for both r (main panel) and r (inset), where drops to defector 
dominance are abrupt and occurring without precursors. 

We emphasize that the results concerning hybrid scoring 
mechanisms are independent of the group size as long as their 
number, and hence the population size, is sufficiently large, 
and they are also independent of the strategy adoption rule. 
This corroborates our main argument, which is that, regard¬ 
less of the scoring that is used, conditions need to be given 
for cooperators to completely segregate from defectors, i.e., 
to form homogeneous groups without a single defector. The 
identification of defectors has second-order importance. The 
key goal of scoring is thus to allow cooperators to recognize 
each other efficiently and to form homogenous groups accord¬ 
ingly. 


Discussion 

“The most important unanswered question in evolutionary 
biology, and more generally in the social sciences, is how co¬ 
operative behaviour evolved and can be maintained in human 
or other animal groups and societies” (Robert May in his Pres¬ 
idential Address to the Royal Society in 2005). A seminal ex¬ 
planation for the puzzle of cooperation is based on image scor¬ 
ing a mechanism that is both stunningly successful and 
stunningly simple. However, in its original formulation and 
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FIG. 4: Hybrid scoring requires a critical population size to stabilize 
cooperation under adverse conditions. Cooperators essentially com¬ 
pete to segregate from defectors before being completely wiped out. 
The lower the value of r (main panel) and p (inset), the larger the 
population size needed for cooperation to be stabilized. Note that for 
a sufficiently small population size (see legend), there exist sudden 
drops to zero cooperation levels at critical values of r and p. Shown 
is the stationary fraction of cooperators in dependence on r (main) 
and p (inset), as obtained for different numbers of groups forming 
the population (see legend). Parameter values are: p = 0.01 (main) 
and r = 1.1 (inset). 


application it came with the restriction to interactions 
that are pairwise to informational environments that allow a 
complete tracking of individual-level information. 

In the real world, these restrictive assumptions may not be 
germane. Instead, cooperation may involve several groups 
of individuals, and ex post information regarding individual- 
level cooperativeness may only percolate imperfectly through 
group-level information. Our focus in this paper has been to 
determine robust theoretical predictions regarding the emer¬ 
gence and survival of cooperation in such situations. The 
presented results have rather important implications. One is 
negative. Namely, when individual-level information is not 
available, cooperation cannot spread. But there is a silver lin¬ 
ing. When there is at least a ‘grain’ of individual-level in¬ 
formation, this may suffice for cooperators to find each other 
and form groups that are impervious to an invasion by defec¬ 
tors. We have shown that the spread of cooperation is robust 
to extensive imperfections in image scoring, thus extending 
the domain of environments where we should expect flourish¬ 
ing cooperation levels based on this established mechanism 
that fosters indirect reciprocity. Factors that affect the effec¬ 
tiveness of hybrid scoring negatively and positively are group 
size and population size, respectively. The rate of return was 
shown to not matter. 

There is one important component that the model we con¬ 
sidered here externalises, namely the issue of how a hierarchy 
of scores translates into an analogous group formation hierar¬ 
chy. These are questions future work should address. In par¬ 
ticular, we need to address how mechanisms known to play 
such a role in the context of image scoring would translate 
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into the informational setting considered here, and how such 
mechanisms may be designed. Little is known in this direc¬ 
tion. Economic experiments iIt^ [sH] might be particularly 
conductive to such research and guide future theoretical work 
to relevantly address these fundamental dilemmas of human 
cooperation. 

Methods 

Simulation procedure 

The employed Monte Carlo simulation procedure re¬ 
quires the iteration of the following three elementary steps. 
First, two randomly selected players i and j play one instance 
of the public goods game in their current group, thereby ob¬ 
taining payoffs (j)i and (f>j, respectively. Next, player j adopts 
the strategy of player i with the probability given by the Fermi 
function W = 1/{1 + exp[(0j — (j)i)/K]}, where K = 0.1 
quantifies the uncertainty by strategy adoptions Bo|] . Each full 
Monte Carlo step gives a chance for every player to change its 
strategy once on average. The reported fractions of coopera¬ 
tors and defectors were determined in the stationary state. 

Stability analysis 

At a given time t, there are four action-score pairs; 
cooperate-one (c* = s\ = 1), defect-one {c\ = 0 and 
s\ = 1), cooperate-zero {c\ = 1 and s\ = 0), and defect-zero 
(c\ = s\ = 0). Suppose players are hardwired to play either 
action. However, scores change. Depending on the scoring 
mechanism, the action vector c in period-f implies a score (or 
a probability distribution of scores) for period t + 1. Assume 
that in some period all four action-score pairs are represented 
by positive population proportions, pchPoi, Pco ,Pdo- Write 
PCtPd for the proportions of cooperators and defectors. Sup¬ 
pose the action proportions grow/ shrink, for action a = 
{C, D}, given by a replicator equation similar to that in the 
standard form: dpa/dt = p\^ * (tToi - tt) -f (TTao - If), 
where -Was is the expected payoff to action-score pair as and 
7f is the average population payoff. 

Stability of unconditional defection under all scoring rules 

Suppose all players j in period t, independent of their score, 
defect except for one player i who plays c* = 1. W.l.o.g., 
suppose s* = 1. No matter what i’s score, the payoff to i 
representing PC'! is ttci = r/s < 1, while the average payoff 
is 7f = 1 -f -^fE^ir/s) > 0. Hence, dpc/dt < 0 and therefore 
any such process is stable at pc = 0. 


Stability of high cooperation levels under image scoring 

Suppose the four different strategies at time t have mass of 
PCi,PDi,Pco,PDo respectively such thatpui = 0. We shall 
now show that there exists a starting state with pc > {s—l)/s 
such that dpcidt = 0. Suppose thatpci = (n — s)/n. Then 
TTci = r, TTco = {s — n * pDo) * {r/s) and ttdo = 1-1- 
{s -n* pdo) * (r/s). dpc/dt = 0 if ^ *r + * 

(s - n * Pdo) * {r/s) = poo * (1 -f (s - n * poo) * (r/s)). 
This yields pDO = 1/(1 — r -f n * (r/s)). Hence, a high 
cooperation level of pc = is stable if pci = 

{n - s)/n, pDi = 0, PCO = (s/n) - 1/(1 - r -f n * (r/s)), 
and Pco = 1/(1 — r + n* (r/s)). Notice that the temporary 
growth in pci relative to pco is compensated by the score 
dynamics which imply that exactly the proportion by which 
Pco will be replaced by the growth in pci. Notice also that 
these proportions are robust to a small increase in pci = 0 as 
the overproportional growth of HI will result in scores of zero 
in the next period (all His will turn HO in the next round), and 
then lead to a shrinking of HO. 


Instability of high cooperation levels under group scoring 

At first sight, the state with pci = {n — s)/n, pci = 0, 
PCO = {s/n) - 1/(1 - r + n* {r/s)), and pno = 1/(1 - 
r + n* {r/s)) could be stable. Indeed, provided that pci = 0 
this state is stable against growth of proportions pci, pco and 
PDO- However, any small increase in pci destabilizes the 
whole system. This is because, given pci, pco and pco. the 
non-zero proportion pci not only grows overproportional, but 
also - and this is the key difference with image scoring- is 
likely to keep a score of one, and will therefore also not shrink 
in the next period. Hence, the growth of Cl slows down while 
HI keeps growing. There can be no stable state with a positive 
proportion pci as both pci and pco would continue to grow 
faster than their cooperation counterparts. 
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